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ABSTRACT 

The data broadcast problem is to find a schedule for broad- 
casting a given set of messages over multiple channels. The 
goal is to minimize the cost of the broadcast plus the ex- 
pected response time to clients who periodically and proba- 
bilistically tune in to wait for particular messages. 
The problem models disseminating data to clients in asym- 
metric communication environments, where there is a much 
larger capacity from the information source to the clients 
than in the reverse direction. Examples include satellites, 
cable TV, internet broadcast, and mobile phones. Such en- 
vironments favor the "push-based" model where the server 
broadcasts (pushes) its information on the communication 
medium and multiple clients simultaneously retrieve the spe- 
cific information of individual interest. This sort of environ- 
ment motivates the study of "broadcast disks" in Information 
Systems [1; 7]. 

In this paper we present the first polynomial-time approxi- 
mation scheme for the data broadcast problem for the case 
when W = O(l) and each message has arbitrary probability, 
unit length and bounded cost. The best previous polynomial- 
time approximation algorithm for this case has a performance 
ratio of 9/8 [6]. 



1. BACKGROUND AND RESULT 

The input is a set M = {Mi,..., Mm} of messages, 
each with a probability pi and cost Ci , and a parameter 
W — the number of channels. The output is (finitely 
described) infinite broadcast schedule S for the messages 
— specifying for each time t = 0, 1, 2, . . . and channel k, 
a message S(t, k) (if any) to be broadcast at that time 
on that channel. The goal is to minimize the cost of the 
schedule, denoted COST(S) and defined as the expected 
response time plus the broadcast cost of S. 
For a finite schedule S, the expected response time of 
S, denoted ERT(S'), is defined as follows. At each 
time unit, each message is requested by some client 
with probability pi. Once a message is requested, the 



client waits until the next time at which the message is 
scheduled on any channel (or the end of the schedule, 
whichever comes first). ERT(S) is defined to be the ex- 
pected waiting time for a random request at a random 
time. The broadcast cost of S, denoted BC^S 1 ), is de- 
fined to be the total cost of scheduled messages, divided 
by the length of the schedule. 

Throughout the paper, if any real-valued function / is 
defined with respect to finite schedules, then we im- 
plicitly extend it to any infinite schedule S as follows: 
f(S) = limsup n _ >00 / where S n denotes S re- 
stricted to the first n time slots. Thus, the above def- 
initions of expected response time and broadcast cost 
implicitly extend to infinite schedules. All of the infi- 
nite schedules considered in this paper will be periodic, 
in which case this extension is particularly simple. 
The data broadcast problem and special cases were 
studied in [2; 3; 4; 5; 6; 10; 16; 19; 23]. Works studying 
applications and closely related problems include [1; 7; 
9; 11; 12; 13; 14; 15; 17; 18; 19; 20; 21; 22; 24]. Some 
of the above works study the generalization allowing 
messages to have arbitrary lengths, which we do not 
consider here. 

Ammar and Wong [3; 4] proved that there always ex- 
ists an optimal infinite schedule with finite period. They 
also formulated a natural relaxation of the problem that 
gives an explicit lower bound on the optimum; the per- 
formance guarantee in this paper is proven with respect 
to that lower bound. More recently, const ant- fact or 
polynomial-time approximation algorithms have been 
shown [5; 6], the best to date being a 9/8-approximation 
[6] . Although the problem itself is not known to be NP- 
hard, several variants are known to be [6; 16; 19]. 
Khanna and Zhou [18, §1.2] state that it is unknown 
whether the problem is MAX-SNP hard, even when 
W = 1 and without broadcast costs. In this paper, 
we show that it is not (unless P=NP). We present 
the first deterministic polynomial time approximation 
scheme for the problem, assuming the W and each cost 
is bounded by a constant. By "polynomial time", we 
mean that the time taken to output the finite descrip- 
tion of the infinite schedule is polynomial in the number 
of messages m in the input. 



2. SUMMARY OF APPROACH 

Our algorithm is based on a simple new observation 



that works for a special case of the problem. We use 
fairly technical but to some extent standard techniques 
to extend it to the general case. We sketch the idea 
here, glossing over a fair amount of technical detail. 
Ammar and Wong [3; 4] relax the optimization problem 
by allowing messages to (a) be scheduled at non-integer 
times and (b) to overlap, while still insisting that the 
total density of the scheduled messages is at most W, 
the number of channels (the extension to the multiple 
channel case is due to [6]). The density of a message (or 
set of messages) is the total number of scheduled times, 
divided by the length of the schedule. Standard calculus 
yields a solution to this relaxed problem. The solution 
specifies for each message Mi a density d* , meaning that 
the message should be scheduled every r* = 1/d* time 
units. 

Ammar and Wong describe the following simple ran- 
domized rounding algorithm for producing a real sched- 
ule: For t = 1,2,..., for k — 1 . . . W , choose a sin- 
gle message Mi randomly so that Pr{Mi selected} is 
d*/W = W/t*; schedule Mi in schedule slot S(t,k). 
They observe that the expected waiting time for a ran- 
dom request for Mi is essentially r* in this schedule. 
Since the expected waiting time in the relaxed sched- 
ule is essentially r*/2 (because an average request falls 
midway between two successive broadcasts of Mi), this 
yields a 2-approximation w.r.t. expected response time. 
Since the expected broadcast cost of S is the same as the 
broadcast cost of the relaxed solution, the algorithm is 
a 2-approximation algorithm w.r.t. the total cost. Am- 
mar and Wong also describe a greedy algorithm that 
Bar-Noy, Bhatia, Naor and Schieber generalize in [6] to 
the multiple channel case and prove to be essentially a 
derandomization of the randomized algorithm, with the 
same performance guarantee. 

Round-robin within groups. Since our goal is a 
PTAS, we naturally group messages that are essentially 
equivalent (i.e. have essentially the same cost and prob- 
ability). Our simple idea is the following variation of 
Ammar and Wong's rounding scheme, which is most 
simply described as follows: Schedule the messages as 
Ammar and Wong do, but then, within each group, 
rearrange the messages so that they are scheduled in 
round-robin (cyclic) order. The broadcast cost is un- 
changed, but the expected response time improves as 
follows. Whereas before, a random request for a mes- 
sage M in a group G would have waited (in expectation) 
for \G\ messages from G until finding its message, in the 
round-robin schedule, a random request for M will wait 
(by symmetry) for (1 + 2 + • • • + \G\)/\G\) = (|G| + l)/2 
messages from G. That is, the expected wait in the 
round-robin schedule is (|G| + 1)/(2|G|) times the ex- 
pected wait in the Ammar- Wong schedule. Since the 
Ammar- Wong schedule has performance guarantee 2, 
the round-robin schedule has performance guarantee 
max G (|G| + 1)/\G\ = 1 + l/min G \G\. Thus, when the 
groups are all large, the Ammar- Wong relaxation is es- 
sentially tight. 

Extending to the general case. Recall that for our 
purposes a group is a collection of messages with ap- 



proximately (w.r.t. e) the same probability and cost. As 
long as each group has size at least 1/e, the round- robin 
schedule gives a (1 + e)-approximation. 
To extend to the general case, we show the following. 
Any set of messages can be partitioned into three classes 
as follows: 

A — A constant number of important (high proba- 
bility) messages. 

B — Messages belonging to large groups. 

C — Leftover messages, contributing negligibly to 
the cost. 

The basic intuition for the existence of this partition is 
that, due to the rounding, the message-probabilities of 
the successive groups decrease exponentially fast. Thus, 
for all but a constant number of groups (where the 
message- probability is high), either the group is very 
large, or the total probability of the messages in the 
group is very small. Althouth the intuition is basic, ob- 
taining the proof with the appropriate parameters is is 
somewhat involved and delicate. 
Once we have the partition, we proceed as follows: 

1. Find the density a of messages in A in a near-optimal 
schedule of A and B. 

2. Compute an optimal "short" schedule Sa of A hav- 
ing density approximately a. 

3. Schedule the messages in B in the slots not occupied 
by A, using the group-round-robin algorithm. 

4. "Stretch" the schedule, interspersing empty slots ev- 
ery 1/e time units, and schedule the messages for C in 
these empty slots. 

Note that in order to "cut and paste" the schedules 
together, we have to explicitly control the density of A 
and B. This in itself requires little that is new. The 
main new difficulty is the following. In step 3, we are 
using the round-robin algorithm to schedule B, but in 
a schedule that is already partially filled by A. For 
the analysis of the round-robin algorithm to continue 
to approximately hold, we require that the empty slots 
in schedule Sa are sufficiently evenly distributed so that 
the scheduling of B is not overly delayed at any time 
(cost increases quadratically with delay). 
A-priori, imposing this additional requirement on Sa 
might increase the cost of Sa too much. To show that 
this is not the case, we show (using a non-constructive 
probabilistic argument) that there is a schedule of A 
that has constant-length period, density approximately 
a, and cost approximately the cost of any optimal sched- 
ule of A with density a. Since the period of this sched- 
ule is small, the empty slots are necessarily evenly dis- 
tributed. 

The final output of the algorithm is a finite (size lin- 
ear in the input size) description from which an infinite 
schedule with approximately optimal expected cost can 
be generated by a randomized algorithm in an "on-line" 
fashion, where each step requires 0(W) time to sched- 
ule. 

The running time of the various steps is as follows. In 
step 1, only a constant number of densities a need to be 



considered: we can try them all and take the best. For 
each a, the time for the remaining steps is as follows. 
Step 2 can be done in constant time since the schedule 
we are looking for has constant length. Step 3 can be 
done in randomized time in the size of the output. Step 
4 can also be done in randomized linear time in the size 
of the output. 

The final technical hurdle is showing that the algo- 
rithm can be derandomized (extending the analysis of 
the greedy algorithm by Bar-Noy, Bhatia, Naor and 
Schieber to this more complicated setting). The result- 
ing deterministic algorithm outputs a polynomial-length 
schedule, the repetition of which gives the desired near- 
optimal infinite schedule. 



Proof. A message of Gj is broadcast during a time 
slot with probability 1/r, , then the average density of 
the group G 3 is then . Then: E[BC(S)] = ]T\ cj/tj . 
As explained above, a request for a message in Gj waits 
on average 1/2 until the end of the current time slot 
and then (gj + l)/2 broadcasts of a message in Gj on 
average. Then: E[ERT(S)] = \ + 12jP]9j T j SA T^- 
Finally if r = r* , then since Y2jPj9j = 1 an d t* ^ 1, 

we get that: E[COST(S)] < £ .(p^Si+l^ + Cj/r*), 
hence the claimed performance ratio. □ 



Remark 1 Note that the law of large numbers implies 
that the expected cost is obtained with probability 1. 



3. GROUP ROUND ROBIN 

Let the set of messages M be partitioned into groups 
Gi , . . . , G q where group Gj has size gj every message 
in Gj has the same probability pj and broadcast cost 
Cj . Let a be the desired maximum density of M in the 
schedule. In this notation, Ammar and Wong's relax- 
ation of the problem is: 



LB(M, a) 



1 2 
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Subject to: — < aW 



Lemma 1 (Lower Bound [3]) The minimization 
problem LB(B,a) is a lower bound to the contribution 
of the messages of B to the cost of any schedule S over 
W channels, in which B has density ^ a. 
The problem has a unique solution r* satsifying: 
9jTj = \J (2cj + \*)/pj, for some A* > 0. If 
12 j e b 9 i V Pi / (2 C 3 ) ^ aW, then A* = 0; otherwise, 
A* is the unique solution to: 12j^B \ZPi/(^ c i + ^*) ~ 
aW. 



Lemma 2 (Randomized Approximation) In the 

setting of this section, the randomized algorithm 1 con- 
structs a one-channel schedule S whose cost satisfies: 



E[COST(S)] 



E 

2 = 1 



If t is chosen in order to minimize LB(Af, 1), then 
algorithm 1 is a maxj(l + 1/ gj)- approximation. 



Algorithm 1 Group round-robin algorithm 

if E'=i !M < 1 then 

. Add a dummy group Go with po = Co = go = 

and 1/to = 1 - 12 9 j= i 1/Tj- 
for t — l-.co do 

. Draw at random a group Gj with probability 1/t,- . 

Schedule in time slot t, the next message of group 

Gj in Round Robin order, if j 0; and stay idle 

otherwise. 



4. SCHEDULING A AND B 

Next we treat the case where the set of the messages M 
can be partitioned into two sets A and B such that 

• A consists of a constant number of messages 

• B is partitioned into groups as in the previous 
section, such that each group has size at least 
«:(e)|A| 2 , where k = k(e) will be defined later. 

Recall from the discussion in the introduction that the 
challenge at this point is to show that there is a near- 
optimal schedule of A with the appropriate density a 
and in which the empty slots are relatively uniformly 
distributed. If so, then we can find the desired schedule 
for A by exhaustive search, and then schedule B into 
the empty space in the schedule using the round-robin 
algorithm previously described. 

To show the existence of the desired schedule for A, we 
show there is a near-optimal schedule of A with the ap- 
propriate density and with constant period (independent 
of a). 

Lemma 3 Given a set of messages A, with cost at most 
C, some constant < £ < 1 and a density < a < 1, 
there exists a periodic schedule S satsifying: 

1. The density of empty slots is S is approxi- 
mately (1 — a): 

1 -a(S) > (1 -e)(l -a), and a(S) €]0, 1[ 

2. The cost of S is approximately optimal: 

COST(S, A) < (1 + e) OPT(A, a) + e/2 

3. The period Ts of S can be bounded: 



Proof sketch. Our proof uses the probabilistic 
method. The main new, simple idea, is in the con- 
struction, which efficiently smoothes the cost function 
by erasing its possible wide variations over time in the 
particular schedule under study. 

Let T be a parameter to be determined later. Let S* 
be a periodic schedule of A with density a and which 
is nearly optimal: basically, COST(S , *,,4) ^ (1 + 



e)OPT(A,a). Let T* denote the period of S* , which 
w.l.o.g. is a multiple of T. 

From S* , construct another periodic schedule S2 by in- 
serting in S* , every T steps from a random starting 
point, all the messages of A in a fixed order. 5*2 is thus 
structured into blocks of length T + \ A\. Let S„ be the 
random schedule obtained by concatenating n blocks 
chosen at random from S2. For suitable values of T 
and n, we can prove that with positive probability, 
S n satisfies the first two statements of the Lemma. The 
period of S n is clearly n(T + \ A\), which together with 
the choice of T gives the third statement of the Lemma. 
□ 

401n(l + 4/e) t 

Notation 1 Let Tie) = — y, r^rnax(C, 1) de- 

v ; e 4 (l-e/6) v ' ; 

note the bound of the lemma for period (S 1 )/^ 2 . Pa- 
rameter k will be defined as k(e) =def 2WT(e)/e. 

The algorithm for scheduling AuB is given below, and is 
an e-approximation for the cases studied in this section. 
Since this is the critical case, the analysis is promoted 
from a lemma to a proposition. 

Proposition 1 Let e < 1/7. In the setting of this sec- 
tion, Algorithm 2 yields a schedule S, such that: 

E[COST(S)] < (1 + 5e) OPT(A U B) 



Algorithm 2 Scheduling A and B 

for x = l..(W ■ T(e)\A\ 2 - 1) do 

. Compute an optimal periodic schedule S a of A 
with density a = x/(W ■ T(e)\A\ 2 ) and period < 
T(e)\A\ 2 . 
. Choose Qo which minimizes: 

COST(S ao ,^) + LB(B,l -a ) 
. Compute the r* that minimizes LB(B, 1 — a ); Add 
a dummy group Go with po = c = and r *, such 
that: 1/to* = (1 - a )W - £ jeB 1/t/. 
Output: 

for t — I..00 do 

. Schedule during time slot t, the same messages 
of A on the same channels, as in S ao ■ 
for all empty slot s during time slot t do 

. Draw a group Gj of B with probabil- 
ity l/(r*(l — ao)W). Schedule in slot s, the next 
message of Gj in Round Robin order, if j =fc 0; 
and stay idle otherwise. 



Proof sketch. The proof works in two steps: 

1. Scheduling the messages of B with the randomized 
algorithm 1 in the empty slots achieves a good 
approximation of COST(S' ao , A) + LB(B, 1 - a ) 
(Using the mapping lemma 10). 

2. COST(S ao , A) + LB(B, 1 - ao) is a good approx- 
imation of the optimal cost (Using Lemma 3). 

□ 



5. C — THE NEGLIGIBLE MESSAGES 

In this section, to show how to incorporate the "neg- 
ligible" messages into the schedule. We assume that 
the set of messages M is partitioned into two sets AB 
and C, where C has a "negligible contribution" to the 
cost. (This section can be skipped by the reader who is 
in a hurry). 

Definition 1 A subset of messages C C M has negli- 
gible contribution if its contribution to the lower bound 
is 0(e), when it is scheduled on one channel with den- 
sity 0(e/C), i.e.: 

LBvk=i(C,£/(10C)) sg 3eOPT(M)/10 
The constants 1/10 and 3/10 are arbitrary and are cho- 
sen in order to improve readability in the following re- 
sults. 

Basically, a subset of messages C is negligible if its con- 
tribution to the cost is small, in the schedule constructed 
by inserting its messages are inserted from time to time 
(every 0(l/e) steps) into a schedule of the rest of the 
messages. 

Lemma 4 Consider a set of messages M, partitioned 
into two sets AB and C , where C has a negligible con- 
tribution to the cost. Then, for any schedule S of AB, 
one can construct in linear time a random schedule S' 
of M, such that: 

E[COST(S")] 

< (1 + ^) COST(5, AB) + f§ OPT(M) 

If COST(S, AB) s; OPT(M), this is a (1 + e)- 
approximation. 

PROOF. We first construct from S another sched- 
ule S*i of AB by inserting an empty slot, on all the 
channels, every lOC/e — 1 slots, starting at a random 
point in {0, . . . , WC/e — 2}. The stretching lemma 8 en- 
sures that: E[COST(5i)] «S (1 + s/W)COST(S,AB). 
Let t* be the solution to the minimization prob- 
lem LB w =i(C,e/(10C)). We obtain S' by scheduling 
the messages of C on the first channel in the empty 
slots of Si, according to the randomized algorithm 1 
with r = 10r*C/£. Lemma 2 and the scaling lemma 9 
ensure that the expected contribution of C is bounded 
by 3 LB(C, e/(10C)) s£ 9e OPT(M)/10. □ 

Remark 2 The algorithm above can easily be deran- 
domized by trying all the starting point and choosing 
the one that minimizes the over cost for the messages 
of AB and use the greedy algorithm 4 to schedule C. 

6. PTAS FOR DATA BROADCAST 

We now assume that we are in the general case. The 
aim of the section is to prove the following theorem, 
which is the main result of the paper. 

Theorem 1 (PTAS) Given e < 1/7 and a set M of 

messages, with message costs bounded by C, Algorithm 6 



constructs in 0(m 2 ) time a periodic schedule S with pe- 
riod < (m 2 + mmax(l,C))/e, so that: 

COST(S) < (1 + lie) OPT(M) 

We will first derive a PTRAS that will be derandomized 
in Section 6.2. 

6.1 Randomized 

We now need to put together the ideas developed for 
the special cases of the previous sections. As a prelim- 
inary treatment, we use standard rounding techniques 
to reduce the number of different messages. 

Lemma 5 (Rounding) Without loss of generality, we 
can assume that the request probabilities pt are a mul- 
tiple of powers of 1/(1 + e) and the broadcast costs are 
multiples ofe/W: 

Pi = r/(l + e) 3 , for some j > 1 

d = k ■ e/W, for some k € {0, . . . , C ■ W/e} 

where 1 < r ^ 1 + e. 

PROOF. Standard and omitted. □ 

The following lemma is the main tool for putting to- 
gether the various special cases studied so far, and is 
thus a key part of our construction. We would like to 
claim that similar ideas could be applied to other prob- 
lems as well, however we were unable to abstract simple 
and general ideas from the technical proof. Perhaps, if 
one believes that every approximation scheme rests on 
one "structural lemma" , it can be seen as the structural 
lemma for this problem. 

Lemma 6 (Partition) Given e > and k > 0, 
one can construct, in linear time in m, a partition of 
the groups (Gj t k), of messages with probability r/(l + 
e) J (where r is the normalizing constant such that 

£{^i;fc=o..cw/ e } r flW( 1 + = 1 ) and cost ke/W, 
into three sets A,B,C so that: 

1. The groups of A have total size constant: \A\ =def 
Sg- k eA9i,k = O s , K} c,w(l), independent of m. 

2. The groups of B are all large: 

V(G 3 , fe €B),g j:k > k\A\ 2 

3. The messages in C have negligible contribution if 
they are scheduled rarely (with density 0(e/C)): 

LB W=1 (C :T ^)^ f§OPT(M) 

Proof sketch. Since the proof is rather technical, 
we will only in this extended abstract give the construc- 
tion of the partition into A, B and C in the case when 
there are no costs (C = 0) and there is only one broad- 
cast channel (W = 1); this already contains the gist of 
the proof. 

Let a = (l + e) _1 <l. In the case where there are no 
costs, the lower bound can be solved explicitly (see [3; 
6]) even when there is a density constraint, to yield, for 
any subset X of the message set: 

LB(X,a) = £& G . ex g ja i/ 2 ) 2 



The construction. The construction is best understood 
by referring to figure 1. We first deal with indices such 
that gj ^ aT ] ^ . Let jo be some constant to be defined 
later, and define Ci = {(j : j > jo, gj ^ a~ J / 4 }, and 
— {j '■ j ^ jo, gj ^ a~ J / 4 }. (One can observe al- 
ready that since the contributions of the messages of C\ 
form the tail of a geometrically decreasing series, they 
will be negligible, and so they will end up in C; more- 
over, since j and gj are both bounded for the definition 
of A\, set A\ can only contain a small number of mes- 
sages and so these messages will end up in A). 
We now consider the more delicate case of the groups 
for which gj > a^^ A , for which we will need to use the 
pigeon hole principle. We partition their indices into 
(20/e) blocks as follows: 

Ai = {(j,k) :^^j <fi 2 }, 
A 2 = I:././-'!://' j /'''I---- 

A 2 o/ £ = {(j,fc):M 2 ° /e ^i<M 1+20/e }, 

where /i is some constant to be defined later. Ac- 
cording to [3], we can then rewrite the lower bound 
on the expected response time as ^/2LB{M)/r — 

Y,j9ja j/2 > J2hJ2jeA h 9ja j/2 , and the pigeon hole 
principle tells us that there exists at least one h such 
that J2,eA h 9j a3/2 «S ^oV 2LB ( M )/ r - We now de- 
fine A 2 = {j : j < n h , gj > a~ j/4 }, C 2 = A h , and 
/' !./:./ /''•• 9i >a" i/4 }. 

Finally we set A = A\ U A 2 and C — C\ U C 2 as shown 
on Figure 1. 




Figure 1: The partition. 

□ 

It is now a simple matter to take our building blocks 
and deduce a randomized approximation scheme for the 
general Data Broadcast problem. 

Proposition 2 (PTRAS) Given < e < 1/7, the 
randomized algorithm 3 yields a random schedule S with 
cost: 

E[COST(S)] < (1 + lOe) OPT(M) 



Algorithm 3 A PTRAS 



Algorithm 4 Greedy Algorithm 



. Round the probabilities and costs of the messages 
in M, and partition the set of messages M into three 
sets A, B, C, according to Lemma 6 with k = n(e). 
. Schedule A and B with algorithm 2. 
. Insert the messages of C into the schedule of A 
and B, with the algorithm described in Lemma 4. 



Proof. Consider the rounded instance M of the set 
of messages. According to the previous Proposition 1 
and Lemma 4, we have: 

E[COST(5)] s: (1 + e)(l + he) OPT(M) 
But Lemma 5 ensures that: 

OPT(M) sC (1 + 3e) OPT(M) 
which yields the result. □ 



Note 1 The insertion of C can be done at the same 
time than the broadcast of A and B in Algorithm 3. 

6.2 Derandomization 

The PTRAS has one slight problem, namely, that it 
is not periodic, hence may be somewhat awkward to 
implement in some settings. In this section we deran- 
domize it using greedy choices, and show how to control 
the period of the resulting algorithm. 



Definition 2 (State) We define the state 
( s 5,fc)i^3^g,i^'=^9j at s l°t t as the time period elapsed 
from the beginning of the k th of the gj last broadcasts 
of group Gj to the end of slot t, as shown Figure 2. 



last gj = 4 broadcasts of a message in Gj 

f ~ -v. Slot t 

^ ^ lh^^ 



Figure 2: Definition of the state at time slot t. 



. Add a dummy group Go, if needed, 
for t — l..oo do 

. Let (sj t k) be the state at time slot t — 1. 

. Let j € {0, ■ ■ ■ ,q} which minimizes: 
(cj -PjTjJ2k=iSj,k) 

. Schedule during slot t, the next message of Gj 

in the Round Robin order, if j / 0, and stay idle 

otherwise. 



Proof sketch. The greedy choice at time slot t is 
made in order to minimize the expected cost of the al- 
ready allocated slots 1, . . . , (t— 1), if the schedule contin- 
ues with the randomized algorithm 1 after time t; this 
property ensures that the greedy schedule is at least as 
good as the randomized one. □ 

The above greedy algorithm could conceivably have very 
large period. The lemma below shows that we can trun- 
cate it so as to obtain a periodic schedule of polynomial 
length. 

Corollary 1 (Greedy periodic schedule) Given a 



set of messages partitioned into groups G\ 



,G q of size 



gj, a set of reals Tj > such that J2j V T j ^ W , and 
any T ^ (8m 2 + (AC — l)m), Algorithm 5 yields a one- 
channel schedule S with period (T + 2m), whose cost is 
bounded by: 



COST(S') < 



j=i 



Pi 



9j(gj + i) 



Algorithm 5 A periodic greedy algorithm 



(rn + 



. Schedule during slot t = l..m message M t . 
. Execute the greedy algorithm during slots t ■ 
l)..(T + m). 

. Sort in increasing order the set {krj : 1 ^ j ' ^ q; 1 ^ 
k ^ gj} and Schedule in slots t = (T+m+l)..(T+2m) 
in order of increasing krj, the k th message of group 
Gj in the Round Robin order. 



Proof. Omitted. □ 
Our main algorithm can now be found in Algorithm 6. 



Lemma 7 (Derandomization of Algorithm 1) 

Given a set of messages partitioned into groups 
Gi, . . . ,G q of size gj, and a set of reals Tj > so 
that y\ 1/tj ^ 1, the greedy algorithm 4 yields a 
one-channel schedule S whose cost satisfies: 

COST( 5 )^( P ,^L±i) r , + |)_l 

If t minimizes LB(M, 1), we get a maxj(l + 1/gj)- 
approximation. 



Proof of Theorem 1. Theorem 1 is proved by ana- 
lyzing the algorithm 6. The analysis is derived from the 
analysis of the PTRAS. The six first steps are exactly 
the same, except that the periodic greedy algorithm 5 is 
used instead of the randomized algorithm 1. Since the 
performance ratio in Algorithm 5 is better, the sched- 
ule S obtained Step 6 is at least as good, and is periodic 
with period 0(m 2 ): 

COST(S) < (1 + lOe) OPT(M) 
We finally reduce the period in Steps 7-8 by using 
stretching lemma 8, which ensures that at an increase of 
(1 + 0(e)) of the cost, we can extract from 5 a block S* 

with length < m2+m 7 x(1 ' c) and: 



Algorithm 6 The PTAS 

1. Round the probabilities and costs, and partition M 
into A, B, C as in the PTRAS. 

2. Compute r* and the density ao and periodic sched- 
ule S a „ of A to minimizes LB(_B, (1 — ao), as in 
Algorithm 2. 

3. Compute the greedy periodic schedule Sb of B with 
r = (r*(l - a )W) and with period {T(e)|A| 2 (l - 
a )W{8m 2 + {AC + l)m)} = 9(m 2 ). 

4. Concatenate (8m 2 + (4C + l)m) periods of S ao and 
map Sb into the empty slots in the natural order. 

5. Compute the greedy periodic schedule Sc of C 
with r = (WCt*/e) where r* minimizes 
LB(C,e/10C), and with period {T(e)\A\ 2 (8rn 2 + 
(4C + l)m)e/10C} = 6(m 2 ). 

6. Choose the best starting point in {1, . . . , — 1} 
and stretch the schedule of A and B by inserting a 
slot of Sc on the first channel every (i|£ — 1) and 
an empty slot on the other channels at that time. 
Let S be the resulting schedule. 

7. Choose the best starting point in {1, . . . , m \ mC — 
m} and construct S' by stretching S by inserting 
the m messages in fixed order on the first channel 
every ( ™ 2 +™c - m ). 

8. S" is then structured into independent blocks of 
length m2 + mC . The cheapest block S* will be the 
period of our approximation. 



COST(S") < (1 + lie) OPT(M) 

□ 

7. TECHNICAL LEMMAS 

The lemmas in this sections are useful for analyzing sev- 
eral of our constructions. The stretching lemma states 
that changing a schedule by inserting a few empty slots 
once in a while does not affect the expected response 
time. 

Lemma 8 (Stretching) Given a schedule S on 
W channels of M and a positive integer y, let 
k ^ y_±M. _ y Consider the schedule S' obtained 
from S by inserting y empty slots just before the time 
slots x, x + k, . . . , x + i ■ k, . . ., where x is a random time 
in {1, . . . , k}. Then: 

E[ERT(S')] ^ (1 + e) ERT(S) 

Proof. Omitted. □ 

The scaling lemma is immediate. 

Lemma 9 (Scaling) Given a set of messages M and 
a schedule S, let S a the schedule obtained by scaling 
S by a factor 1/a: S a schedule at time t/a on some 
channel the same message as S at time t, and stays idle 
otherwise. Then: 

ERT(S a ,A) = i ■ ERT(S, A) 
BC(S a ,A) = a-BC{S,A) 



Proof. Immediate. □ 

The mapping lemma is used for analyzing the effect of 
inserting the messages from B into the slots left empty 
in the density-constrained schedule of A; these slots may 
be spaced irregularly. 

Lemma 10 (Mapping into reserved empty slots) 

Given a set of messages M, partitioned into groups of 
identical messages, such that all groups are larger than 
TW, consider a one- channel schedule S of M schedul- 
ing each group in Round Robin order, and a periodic 
sequence of reserved time-slots over W channels with 
density a and period T. Let S' be the schedule obtained 
by mapping the schedule S into the reserved empty slots 
from left to the right, then: 

ERT(S',A) < -L ■ ERT(S, A) + T ]T Pl 

M t eA 

BC{S',A) = aW-BC(S,A) 
Proof. Omitted. □ 

Corollary 2 (Case of large groups) In the case 
where M is partitioned into groups of size ^ 2T/s, 
S' has a cost bounded by: 

COST(S",^) sj 

(1 + e){ ^ • ERT(S* A) + aW ■ BC(S, A) j 

Proof. Simple calculation. □ 
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